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DECOMPOSITION TABLES FOR EXPERIMENTS I. 
A CHAIN OF RANDOMIZATIONS 

By C. J. Brien and R. A. Bailey 
University of South Australia and Queen Mary, University of London 

One aspect of evaluating the design for an experiment is the 
discovery of the relationships between subspaces of the data space. 
Initially we establish the notation and methods for evaluating an 
experiment with a single randomization. Starting with two struc- 
tures, or orthogonal decompositions of the data space, we describe 
how to combine them to form the overall decomposition for a single- 
randomization experiment that is "structure balanced." The relation- 
ships between the two structures are characterized using efficiency 
factors. The decomposition is encapsulated in a decomposition table. 
Then, for experiments that involve multiple randomizations forming a 
chain, we take several structures that pairwise are structure balanced 
and combine them to establish the form of the orthogonal decomposi- 
tion for the experiment. In particular, it is proven that the properties 
of the design for such an experiment are derived in a straightforward 
manner from those of the individual designs. We show how to formu- 
late an extended decomposition table giving the sources of variation, 
their relationships and their degrees of freedom, so that competing 
designs can be evaluated. 

1. Introduction. This paper investigates methods for evaluating designs 
for experiments with multiple randomizations [14] that follow each other in 
a chain, like those in Figures 1 and 2. In general, the form of the analysis- 
of- variance table for an experiment is informative when evaluating a design 
for it. The foundation of this table is the orthogonal decomposition of the 
data space that it reflects. Hence the purpose of this paper is to establish 
the appropriate orthogonal decomposition of the data space for some ex- 
periments with multiple randomizations. This requires an extension of the 
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2 Rosemary 

3 Irradiation 



6 treatments 



3 Blocks 

6 Meatloaves in B 
18 meatloaves 



3 Sessions 
12 Panellists in S 
6 Time-orders in S 

216 tastings 



Fig. 1. Randomization diagram for Example 1: treatments are randomized to meatloaves, 
which are in turn randomized to tastings; B denotes Blocks, S denotes Sessions. 



decomposition for standard textbook designs, almost all of which involve a 
single randomization. 

Example 1 (Meatloaves). At the Joint Statistical Meetings in New York 
in 2003, T. B. Bailey described the two-phase sensory experiment shown in 
Figure 1. This figure is repeated from [14], where the conventions used in 
such diagrams are explained. In the first phase of the experiment, there were 
six treatments, consisting of all combinations of a two-level factor Rosemary 
(present or not) with three quantities of irradiation. These treatments were 
applied to the process of making meatloaves, using a randomized complete- 
block design with three blocks. In the second phase, blocks were random- 
ized to sessions. In each session, twelve panellists tasted a portion of all six 
meatloaves from the block assigned to that session. The design in this phase 
consisted of a pair of 6 x 6 Latin squares in each session. The combined effect 
of these two randomizations is to randomize treatments to tastings. 

The design for each phase of this experiment is orthogonal, in a sense 
that we make precise in Section 4. It should therefore be straightforward to 
obtain the appropriate decomposition. However, as T. B. Bailey reported, 
the analysis of variance given for similar two-phase experiments is frequently 
wrong. 

To obtain the decomposition for experiments with a chain of random- 
izations, we have to assume that the design for each single randomization 
satisfies some conditions, which are explained in Section 4. Although they 
are rather technical, they are satisfied by many designs used in practice. The 
decomposition that we obtain works just as well for complicated nonorthog- 
onal designs like the one in Figure 2 as it does for orthogonal designs. 

Our approach extends that of Fisher [20], Wilk and Kempthorne [38], 
Nelder [28, 29] and Bailey [1, 9] for a single randomization. The objective of 
a randomization is to assign one set of objects, here designated T, to another 
set of objects, here designated 0. Frequently T is the set of treatments and 
f2 is the set of observational units. In Example 1, the set described by the left- 
hand panel consists of six treatments while the set described by the middle 
panel consists of 18 meatloaves. The randomization of the 18 meatloaves 
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to the 216 tastings is also considered to be a single randomization, even 
though there are multiple arrows between their panels. This is because it 
can be achieved by selecting a single permutation of the 216 tastings, as 
explained by Brien and Bailey [14]. 

In Section 3, we define structure on a set to be an orthogonal decomposi- 
tion of the relevant vector space. The structure on the set of treatments in 
Example 1 consists of the spaces for the grand mean, two main effects and 
their interaction; the structure on the meatloaves consists of the spaces for 
the grand mean, differences between blocks and within-block differences. In 
general, the structure on f2 is the unrandomized structure, which reflects the 
topographical, managerial or physical features that are there before treat- 
ments are assigned. 

Although our main results apply to quite general structures, we limit our 
examples to structures which are uniquely defined by a list of factors, their 
numbers of levels and their nesting relationships. Such a structure can be 
succinctly shown in a panel. In the right-hand panel in Figure 1, Panellists 
and Time-Orders are both nested in Sessions, in the sense of [28] and [14]: 
thus there are 36 panellists in total, not twelve. 

Brien [11] introduced the term tier to describe the collection of factors 
on a set before it is associated with another set by randomization. Most 
standard textbook designs have two tiers. Brien and Bailey [14] introduced 
randomization diagrams to summarize such designs. There are two panels, 
one for each tier. Underneath each tier is written the corresponding set; this 
may be either a symbol, such as T or 0, or a defining phrase, like "6 treat- 
ments." Arrows go from the randomized tier (on T) to the unrandomized 
tier (on SI). 

The decomposition for such a design is obtained by starting with the 
structure on and then refining it by the structure on T, according to 
the relationship between the two structures. The decomposition consists of 
the set of orthogonal subspaces of the data space which is characterized 
by the set of mutually orthogonally idempotents derived from those of the 
two structures. It is displayed in a decomposition table which has rows and 
columns. Each row corresponds to one of the subspaces in the decomposition: 
some authors call this a "source of variation." The number of degrees of 
freedom shown in this row is just the dimension of the subspace. These tables 
also show efficiency factors, that we define in Section 4. Bailey [9] gives 
such decompositions for orthogonal designs, calling the ensuing table the 
skeleton analysis of variance. Most other textbooks on design and analysis 
of experiments do not do this. Instead, it is usual to write down a model 
of terms thought to be relevant, which are then listed in an ANOVA table. 
This method has several disadvantages: the resulting model is somewhat 
arbitrary [38] ; the analysis does not reflect the confounding arising from the 
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randomization; and, for nonorthogonal designs, non-unique partial analyses 
may be produced [15]. 

Experiments with two or more randomizations involve three or more tiers, 
so Brien [11] termed them multitiered experiments. Three or more panels 
are required in their randomization diagrams. As noted in [14], multitiered 
experiments include two-phase, some superimposed and some single-stage 
experiments, and some multistage experiments which use the same units 
at each stage; they do not represent a collection of new designs, but are a 
class of designs made up of several existing design types. However, general 
methods for assessing their properties have not been established. 

This paper considers those multitiered experiments where the arrows in 
the randomization diagram follow each other in a chain. This commonly oc- 
curs in two-phase experiments [27], particularly those that include a second, 
laboratory phase after an initial phase: examples are sensory experiments, 
field experiments followed by laboratory processing, and gene-expression mi- 
croarray studies. In the simplest case, there are two randomizations and 
three sets of objects, f2, T and T, each with a tier of factors and a structure: 
r is randomized to T, and T is randomized to 0. Each randomization may 
employ a standard textbook design. In [14] , such a pair of randomizations 
is called randomized-inclusive if the systematic design for the allocation of 
T to must use information from the outcome of the randomization of T 
to T; otherwise the pair is composed, as in Example 1. 

Section 2 uses a nonorthogonal two-tiered experiment to demonstrate the 
approach of using the decomposition table to evaluate the design. It then 
poses the problem of how to obtain the decomposition table for a three- 
tiered experiment. Section 3 makes precise what we mean by structure on a 
single set of objects. Section 4 outlines the relevant concepts for an exper- 
iment with a single randomization, in order to make them precise and to 
establish notation. In particular, we show how to take two structures and 
combine them to produce a single decomposition table for a two-tiered ex- 
periment. Sections 5 and 6 extend these results to obtain the decomposition 
table for experiments involving a pair of randomizations that are either com- 
posed or randomized-inclusive. Section 7 generalizes this to longer chains of 
randomizations. 

The analysis-of-variance table for the analysis of a response variable is 
an extension of the decomposition table. It includes a sum of squares in 
each row. This has the form y'Dy where y is the data vector and D is 
the idempotent matrix of rank d which projects onto the subspace in the 
decomposition for the row. The subspace is variously called the image of 
D, written ImD, or the column space of D. The mean square, y'Dy/d, is 
usually shown. Sometimes the expected value of this mean square, based on 
either a randomization model or a mixed model for the response variable, 
is also included. In this paper, we are not attempting to obtain the whole 
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4 Trellis - 
2 Methods - 



treatments 



Squares 

3 Rows 

4 Columns in Q 



■ 2 Halfplots in Q, R, C 



half-plots 




-2 Occasions 
3 Intervals in O 



" 3 j2 '>t-6 Judges 



4 Sittings in O, I 
4 Positions in O, I, S, J 
576 evaluations 



Fig. 2. Randomization diagram for Example 2: treatments are randomized to half-plots, 
which are, in turn, randomized to evaluations; Q, R, C, O, I, S, J denote Squares, Rows, 
Columns, Occasions, Intervals, Sittings and Judges, respectively. 



analysis-of-variance table, merely the appropriate orthogonal decomposition 
of the data space so that the properties of an experimental design can be 
established. In particular, we have not given the expected mean squares, 
which are often of use in assessing a design. However, for the randomization 
model, they are easily deduced from the decomposition table, and so the 
likely relative sizes of variances of effects under this model can be gauged. 

2. Evaluating the design for one experiment. In designing a standard 
two-tiered experiment, one considers the sources of natural variation, the 
sources with which the treatments are confounded and the amount of in- 
formation about different treatment contrasts available from the different 
sources. 



Example 2 (A viticultural experiment). Brien and Payne [15] describe 
a viticultural experiment comprising the field phase of a two-phase experi- 
ment. Two adjacent Youden squares are used to assign trellis treatments to 
the plots, a plot being a row-column combination within a square. Each plot 
is divided into two half-plots, to which two methods of pruning are assigned 
at random. There are two sets of objects involved in this phase, half-plots 
and treatments, with corresponding panels shown in the middle and left of 
Figure 2. In this example, Q will be the single letter used for Squares to 
avoid a conflict with Sittings. 

When this phase of the experiment is being planned, the next step is 
to establish properties of the design by obtaining the decomposition in 
Table 1 (further details about its derivation are in Section 4). The left- 
hand side of this table contains what Fisher [20] called the "topographi- 
cal analysis;" it reflects the unrandomized structure on the half-plots. The 
other half of the table gives the randomized structure derived from the 
treatments. The table also shows the relationship between the two struc- 
tures. Trellis is partially confounded with the sources Columns [Squares] and 
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Rows#Columns[Squares], with efficiency factors 1/9 and 8/9, respectively. 
Method and Trellis^Method are both totally confounded with 
Halfplots [Squares A Rows A Columns] . Hence, in a randomization model, the 
expected mean square for the Residual in Rows#Columns[Squares], with 
nine degrees of freedom, is equal to the variance £rc[Q] of the Row-Column 
interaction within Squares. Further, the variance of a difference between two 
types of trellis, if estimated solely from contrasts in Rows#Columns[Squares] , 
that is, only from contrasts between Row-Column combinations within 
Squares that are orthogonal to contrasts for both Rows and Columns within 
Squares, is (2/12) x (9/8)£ R c[qj- Then the designer may be satisfied that 
most of the information about types of trellis is confounded with a source 
from which many larger sources of variation have been eliminated. 

In the subsequent sensory phase, the half-plots from the first phase are 
randomized, using two Latin squares and an extended Youden design, to 
glasses in positions on a table for evaluation by judges. This sensory phase 
adds a third set of objects, evaluations, with associated panel shown on 
the right of Figure 2. What are the properties of this two-phase experi- 
ment, whose randomizations are composed? We would like to establish the 
decomposition table for the full experiment. How should one combine the 
properties of the second-phase design with those of the first-phase design to 
yield the decomposition table for the final two-phase design? 

Table 1 
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Decomposition tables conveniently summarize the properties of designs, 
so they are useful in evaluating potential designs. Such tables go back to, at 
least, Fisher [21], Table 29. They are closely related to the skeleton analysis- 
of-variance tables that are given in [9] and that GenStat [32] produces in 
response to the ANOVA directive with no data. In GenStat tables, sources are 
relabelled to include only original factors (not pseudofactors) , along with 
their interacting and nesting factors, while the efficiency factors (other than 
and 1) are listed below the table. Similar tables can be produced with 
the programs R [36] and S-Plus [25] by analysing randomly generated data. 
These analysis-of-variance tables have separate subtables for each unran- 
domized source, such as sources for the half-plots tier in Example 2; these 
sources are labelled strata in GenStat and [9]. Federer [19] also gives separate 
subtables, with headings such as "main-plot analysis." 

The pair of randomizations in Example 2 form a chain. In what order 
should we do the decompositions for such randomizations: (a) start with 
the decomposition of the half-plots tier, refine it by the decomposition of 
the treatments tier as in Table 1 and then use this refinement to refine the 
decomposition of the evaluations tier, or (b) start with decomposition of the 
evaluations tier, refine it by the decomposition of the half-plots tier and then 
refine this refinement using the decomposition of the treatments tier? These 
questions are addressed in full in Sections 5 and 6. 

3. Structure in a single tier. Given a set f2 of objects, let Vh be the 
space of all real vectors indexed by fi. This space is also written as M. Q and 
its dimension is If the elements of are . . . ,u n , then a vector in Vh 
has the form (v Ul , . . . , v Un ) . The advantages of using Q to index the space 
Vh are: (i) we do not need to agree on a convention for the order of writing 
the elements of f2; and (ii) if two sets f2 and T have the same size, we can 
still distinguish between Vh and Vy because they are indexed by different 
labels. 

Following Bailey [4], we interpret structure on f2 to mean an orthogonal 
decomposition of the space Vh- Such a decomposition is specified by the set V 
of symmetric, idempotent, mutually orthogonal matrices projecting onto the 
subspaces of Vh in the decomposition. Thus if P € V then P is symmetric 
and idempotent; if Pi and P2 are in V with Pi 7^ P2 then P1P2 = 0; and 
X^Pe'pf = IP) the identity matrix on Vq. We shall identify such a set V 
of matrices with the corresponding orthogonal decomposition of V^. Each 
of these matrices has |0| rows and columns, which are labelled by the 
elements of f2, so it is called an Q x Q matrix. As explained in [7], this means 
that we do not have to agree on an order for writing down the elements of fi, 
so long as we show the labels for the rows and columns. 

In our examples, we restrict attention to structures in which each set is 
uniquely indexed by a set of factors, referred to as a tier. The notation and 
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conventions for such tiers are given in [14], Section 2.2. We use F\ A • • • A F n 
to denote the generalized factor whose levels are the levels combinations 
of F\, F2, ■ ■ ■ and F n , for n > 1. Those with n > 1 are called joint factors 
by Hinkelmann and Kempthorne [23], Section 4.12.1. If a generalized factor 
contains Fi and is to be meaningful, then it must also contain every Fa which 
nests Fi. We consider only such generalized factors, calling them intrinsic. 
In addition, we use the trivial factor Universe, which has a single level. 

The factors in a tier are partially ordered by the nesting relationship in 
[28], so they form a partially ordered set, or poset [10]. When there is one 
object in the set of objects for each combination of the levels of these factors, 
then the ordered pair consisting of the set of objects and the collection of 
intrinsic generalized factors derived from the poset is called a poset block 
structure [2, 3, 6, 7]. In this case, there is one idempotent for each intrinsic 
generalized factor, and we call this collection of idempotents a poset struc- 
ture. 

Sometimes, in addition to the tier, pseudofactors are needed to describe 
the randomization. (The introduction of pseudofactors has no effect on the 
size of the set of objects.) As in [14], if F is a factor or generalized factor, 
then each level of a pseudofactor for F corresponds to one or more levels 
of F. The levels of a pseudofactor have no inherent interest. Pseudofac- 
tors can be combined with other factors and pseudofactors into generalized 
pseudofactors. In these cases, V may include idempotents derived from the 
pseudofactors. 

The generalized factors and pseudofactors can be shown on a Hasse di- 
agram [2, 3, 8, 9, 37], that shows the important marginality relationships 
between them [6, 12, 31, 33, 37]. For generalized (pseudo) factors G and H, 
we say that H is marginal to G if each level of G occurs with only one 
level of H but G is not aliased with H. For poset block structures, H is 
marginal to G if the factors in H are a proper subset of those in G. In the 
Hasse diagram there is a small circle for each generalized (pseudo)factor G; 
the number uq of levels of G is shown beside it. The full factor name is 
sometimes abbreviated to its first letter so long as it has been given in full 
at a higher point in the diagram. If H is marginal to G then H is drawn 
higher than G and joined to it by downward lines. At the top of the diagram 
is the factor Universe, which is marginal to every other generalized factor. 

Corresponding to each generalized (pseudo)factor G is an averaging oper- 
ator A^. This is an f2 x Q matrix whose (a, /3)-entry is if the objects a and 
(3 have different levels of G and otherwise is the reciprocal of the replication 
of the shared level of G. Thus, if y 6 Vq, then the w-entry of A^y is the 
average value of y a for elements a of f2 with the same level of G as u. In 
particular, Adverse = |^| _1 J where J is the JlxS] all-ones matrix. 

Also associated with each generalized factor G is a source: this is the sub- 
space of Vq for differences between the levels of G that are not accounted for 
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by generalized factors marginal to G. Similarly, there is also a pseudosource 
associated with each generalized pseudofactor. Many different notations are 
in use for nesting and interacting factors in sources — Heiberger [22], Sec- 
tion 12.4, compares several. Here we use a notation that is reasonably in- 
tuitive and is unambiguous for poset block structures. In such a structure, 
the factors in a generalized factor G = F\ A • • • A F n can always be ordered 
in such a way that (i) if Fj is nested in Fi then j > i; and (ii) there is some 
m <n such that Fi nests another factor in G if and only if i < m. We write 
the source for G as F m+ i# • • • #F n [F\ A • • • A F m ] if m > 1; otherwise as 

The collection of sources and pseudosources specify an orthogonal decom- 
position of Vq, with an orthogonal projector P in V for each (pseudo)source. 
Starting with Puniverse — -A-universej eac h Pg is obtained by subtracting 
from Aq all those P# for which H is marginal to G. Thus the effect of 
any P on any vector is achieved by a straightforward sequence of averaging 
operations, each equivalent to applying one of the A matrices, and sub- 
tractions. Degrees of freedom da are calculated from the numbers uq by 
exactly the same process; they are also shown in the Hasse diagram, with 
their corresponding sources. 

Example 3. If Q consists of bp plots grouped into b blocks of equal 
size, then the panel for it is like the right-hand one in [14], Figure 1, sim- 
ilar to the middle panel of Figure 1. The poset of factors for plots is the 
tier {Blocks, Plots} with the p-level factor Plots nested in the 6-level fac- 
tor Blocks. The set of intrinsic generalized factors derived from this poset 
is {Universe, Blocks, Blocks A Plots}, and the factor Blocks A Plots, whose 
levels are the combinations of levels of Blocks and Plots, uniquely indexes 
the plots. Hence the set of plots and this set of generalized factors form a 
poset block structure. The generalized factors, with their numbers of levels, 
are given on the left in the left-hand Hasse diagram in Figure 3. Also, on 
the right in this diagram are the three sources in the orthogonal decom- 
position of Vpiots, deduced from the generalized factors, with their degrees 
of freedom; only the first letter of each factor name is shown. The source 
Plots [Blocks] is the subspace for Plot differences within Blocks. It is obtained 
as differences between all combinations of Plots and Blocks after allowing 
for the subspace Blocks. The poset structure on plots is the orthogonal de- 
composition of the 6p-dimensional Vpiots specified by the set of idempotents 

V = {PMcan)PBlocks>Ppiots[B]}- 

The Hasse diagram on the right of Figure 3, formed using the same process 
as that on the left, gives the expressions for the elements of V . It shows 

that PMcan = -^Universe ; ^Blocks = -^.Blocks — Au n j vcrsc , and Ppiots [Blocks] = 
A-BlocksAPlots — PBlocks _ PMcan = AeiocksAPlots _ Aeiocks = I _ Aeiocks • 
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Universe 
1 

Blocks 
b 

B A Plots 
bp 



Mean 

1 

B 

b- 1 
P[B] 
b(p - 1) 



Universe • Mean 

-A. Universe -A-Universe 

Blocks 1 B 

-A-Blocks -A-Bi oc k s Au n i verse 

B A Plots 1 P [B] 

ABlocksAPlots ABlocksAPlots — Asiocks 



Fig. 3. Hasse diagram for Example 3: on the left, it is used to calculate degrees of free- 
dom from numbers of levels of generalized factors; on the right, it is used to calculate 
idempotents from averaging matrices. 



Suppose that 6 = 2 and p = 3. Then = 6 and let Q = {uj\, u?2, L03, W4, U5, 
loq} be the set of plot labels. Further suppose that the levels of Blocks A Plots 
for the plots are 11, 12, 13, 21, 22, 23, respectively, where the first number 
indicates the Block and the second the Plot. In this case the Six!] Block 
averaging operator is 
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Example 4. If the set of eight treatments is indexed by all combina- 
tions of four trellis treatments with two methods of pruning, then there is 
no nesting. The poset of factors in the panel on the left of Figure 2 gives the 
Hasse diagram in Figure 4. This shows the four intrinsic generalized factors 
and their numbers of levels that, together with the set of treatments, form 
a poset block structure. It also gives the four sources derived from the gen- 
eralized factors, and their degrees of freedom. The source Trellis#Methods 
is the subspace for the interaction of Trellis and Methods. It is obtained as 
differences between all combinations of Trellises and Methods after allowing 
for the subspaces Trellis and Methods. The poset structure on treatments is 
the orthogonal decomposition of the 8-dimensional vector space treatments 
specified by the set of idempotents Q = {QMean, QTrciiis, QMcthod, Qt#m}- 



4. Structure for experiments with a single randomization. Here we re- 
call the results and concepts from Nelder [28, 29] and James and Wilkinson 
[26] for an experiment with a single randomization of a set T of treatments 
onto a set f2 of observational units; we also establish some notation. Al- 
though we speak of randomizing the treatments to the observational units, 
the allocation of treatments to units is actually a function / from to X: 
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Universe: Mean 




T A M: T # M 
8 3 



Fig. 4. Hasse diagram for Example 4, used to calculate degrees of freedom. 

the treatment on observational unit u is just /(w) [1]. This is shown dia- 
grammatically in Figure 5. 

The design function / is the result of deliberate randomization. As dis- 
cussed in [14], Section 2, the randomization is often achieved by applying a 
single permutation (perhaps chosen from a restricted set) to the (labels of 
the) observational units, after an initial systematic plan has been written 
down. The initial plan is carefully chosen by the experimenter from among 
many possibilities. Part of his or her choice is governed by practical con- 
siderations such as the limiting of levels of one treatment factor to large 
parts of like whole plots; part is governed by combinatorial considerations 
such as making an incomplete-block design balanced; part is concerned with 
getting desirable values for the efficiency factors, which are discussed in the 
remainder of this paper. 

As in Section 3, there is a structure V on the |0| -dimensional vector space 
Vh- Similarly, the structure on the |T|-dimensional vector space Vx may 
be represented by a set Q of symmetric, idempotent, mutually orthogonal, 
T x T matrices which sum to Iq, the identity matrix on Vy. The basic 
problem is to obtain a single decomposition of Vh that combines the two 
structures V and Q to produce a single set of orthogonal idempotents that 
we denote by V t> Q. 

We call V > Q the set of idempotents for '"P refined by Q" or '"P de- 
composed by Q." It will be shown that, for structure-balanced experiments, 
defined below, the elements of this set are of two types: P l> Q, called "P 
pertaining to Q," and Ph Q, called "P orthogonal to Q" or "the residual of 
Q in P." For each P in V, these decompose ImP, the subspace of Vh onto 
which P projects, into subspaces of the form ImPQ, for Q € Q, and a sub- 




randomized unrandomized 



Fig. 5. Diagram of an experiment with a single randomization. 
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space that is orthogonal to all of the subspaces ImQ. If, for some particular 
P, all of the subspaces ImQ are orthogonal to ImP, then Ph Q = P. 

However, a little care is needed here because, in general, |f2| > |T|, and 
so the size of a P € V can differ from that of a Q 6 Q. Clearly, we would 
like all matrices to be O x Q matrices. To achieve this requires the mapping 
of the space Vy into an appropriate subspace of Vh. Now, the function / is 
often represented by the $7 x Y design matrix X, whose (u,t) entry is equal 
to 1 if f{ui) = t and to otherwise. The column space of X, which we shall 
temporarily call Vy, is the required subspace of Vh; it is isomorphic to the 
space Vy. 

Next we have to turn Q into a structure Qf on v£ ■ If generalized factor Gi 

f 

has averaging matrix A, on Vy, then its averaging matrix on Vh is 

X(AjX'XAj)~X'. In order for generalized factors Gi and Gj on T to remain 

f f f f 

orthogonal when regarded as factors on Q, we must have A;- Aj = Aj A;- ; 
that is, 

X(A i X'XA^X'X(A J X'XA J )"X' 

(4.1) 

= X(A i X'XA j )-X'X(A,;X'XA ! )-X'. 

This is equivalent to the well-known "proportional meeting" condition [6, 
9, 37], which is always satisfied if / is equireplicate. We assume that (4.1) 
holds. Then, like P in V, the projectors in Q? can be calculated from 
the A^ by suitable subtraction. However, if / is not equireplicate then is 
not, in general, equal to X(QX'XQ) _ X'. 

The matrices Q^, for Q in Q, are symmetric, mutually orthogonal idem- 
potents which sum to Ig. However, Ig is not the $7x0 identity matrix: for 
w in Vh , we have Ig w = w if w 6 Vy , but Ig w = if w is orthogonal to Vy . 
In fact, Ig = X(X'X)~X', but we prefer to use notation that suggests that 
this matrix is an identity for certain other matrices. 

Henceforth we identify Vx with Vy, Q with Qj and Q with Q-^, on 
the understanding that it is the particular choice of / that specifies which 
subspace of Vh is to be regarded as Vy. Thus we now regard each Q as an 

x matrix. The matrix Ig is a multiplicative identity for each Q, even 
though it is not for every Q x fi matrix. 

Recall that f2 also has a structure V . To be able to combine V and Q 
into a single structure in a unique way, we need V and Q to satisfy the 
condition of structure balance in Definition 1 below. This definition builds 
on the ideas of [29, 30], in which the design is called generally balanced if 
there are scalars Apq for P in V and Q in Q such that 



(4.2) 



QPIg = A PQ Q 
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for all P in V and all Q in Q. (The Apq necessarily lie between and 1.) 
This is equivalent to the conjunction of the following two conditions: 



for all P in V and all Qi, Q 2 in Q with Qi / Q 2 . 

For each fixed Q and P, condition (4.3) says that every vector w in the 
space ImQ makes an angle cos" 1 ^/Apq with the space ImP. The Apq are 
called canonical efficiency factors in [26]; we abbreviate this to efficiency 
factor. For block and row-column designs, they are the same as those ob- 
tained using the reduced normal equations [24, 26]. 

For each fixed P, condition (4.4) says that the orthogonal spaces ImQi 
and ImQ 2 remain orthogonal when projected onto ImP. It is a form of 
adjusted orthogonality [18]. 

For a single P and Q, James and Wilkinson [26] said that Q has first- 
order balance in relation to P if condition (4.3) holds. However, they did 
not require that P and Q be members of a set of mutually orthogonal, 
symmetric idempotents. 

In [29, 30], general balance is defined for the case that V and Q are simple 
orthogonal block structures, which are those poset block structures that 
can be defined by formulas using crossing and nesting. In their expository 
paper, Houtman and Speed [24] took V as given, although not necessarily 
defined by factors, and defined the design to be generally balanced if there 
exists any orthogonal decomposition Q of Vr such that (4.2) holds. This 
offended writers such as Pearce [35], who do not like the idea that some 
designs, such as all incomplete-block designs, can be made generally balanced 
by using a Q that is a meaningless decomposition. Payne and Tobias [33] 
expressed a strong preference for general balance to be a property of a 
specific Q. Brien [13] defined the design / to be structure balanced if, for 
given structures V and Q, condition (4.2) holds; Bailey [5] called this general 
balance with respect to V and Q. They had in mind that V and Q would 
often be determined by factor and pseudofactor relations as described in 
Section 3, which is the case for all examples in this paper. 

The randomization of T to includes the random choice of one of the 
permutations of £1 which preserve the structure P on O. Thus V is inherent 
in the randomization. However, there can be a choice of structure on T. It 
may be that the natural structure Qi, as defined by factors and marginal- 
ity, is not itself structure balanced in relation to V, but has a refinement 
0,2 involving pseudosources that is. If Q in Qi is partly confounded with 
more than one P, then, either Q has first-order balance in relation to ev- 
ery P, or Q needs to be decomposed into pseudosources Qi,...,Q m , say, 




QPQ = A PQ Q 



QiPQ 2 = 
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in such a way that each Qj has first-order balance in relation to every P. 
Pseudosources play the important role of decomposing some sources into 
pseudosources and a remainder, so that Q2 is structure balanced in relation 
to V. Compared to other possible decompositions of the sources, the advan- 
tage of pseudosources defined by pseudofactors is that all projections can 
still be obtained by averaging and subtracting. Some such pseudofactors are 
used in randomizing the design (as in the second phase of Examples 2 and 5); 
others are introduced only as an aid to the analysis (as in a group-divisible 
block design). 

For clarity, we shall use the following definitions, which apply to all struc- 
tures, whether or not they are natural, and whether or not they are defined 
by factors and pseudofactors. 

Definition 1 . A structure Q is structure balanced in relation to a struc- 
ture V if there are scalars Apq for P in V and Q in Q such that 

(i) QPQ = A PQ Q for all P in V and all Q in Q, and 

(ii) Q1PQ2 = for all Pin? and all Qi ^ Q 2 in Q. 

The structure Q is first- order balanced in relation to V if (i) holds, and the 
structure Q is orthogonal in relation to V if (i) and (ii) hold with each Apq 
equal to either 1 or 0. 

Of course, these definitions implicitly depend on the function /. It is useful 
to summarize the set of efficiency factors for the relationship between the 
two decompositions in the PxQ efficiency matrix A-pQ. This matrix A-pg is 
equivalent to the V x Q table introduced in [24] to summarize the efficiency 
factors in a generally balanced experiment (see also [7] , Chapter 7) . 

Theorem 4.1 (James and Wilkinson [26]). Suppose that Q has first- 
order balance in relation to P with efficiency factor Apq . If Apq 7^ 0, then 
Apq is the unique nonzero eigenvalue of PQP, and so the matrix of or- 
thogonal projection onto P(ImQ) is ApgPQP; if Apq = 0, then ImQ is 
orthogonal to ImP. 

Definition 2. If P and Q satisfy (4.2) with Apq / 0, then we define 
P D> Q to be Ap Q PQP. 

As previously noted, P > Q could be called "P pertaining to Q." More 
accurately, the source Im(P > Q) is "the part of ImP with which ImQ is 
(partly) confounded." Another way to put it is that it is "the part of ImP 
that contains information about ImQ." 

Lemma 4.1. // Apq = 1 then P > Q = Q and so ImQ < ImP. 
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Proof. If A PQ = 1, then QPQ = Q, and so (PQ - Q)'(PQ - Q) = 
(QP - Q)(PQ - Q) = Q - QPQ = 0. Thus PQ - Q = 0. Hence PQ = 
Q = Q' = QP, and so P > Q = PQP = PQ = Q. Now if w € ImQ, then 
w = Qw, and so Pw = PQw = Qw = w: hence w £ ImP. □ 

If (4.3) and (4.4) both hold for a fixed P and all Q, then the images of 
the P D> Q, for Q in Q with Apq ^ 0, are mutually orthogonal subspaces 
of ImP. This leaves the orthogonal complement in ImP of their sum: this 
space is just (ImP) n V-^ and is often called the residual subspace in ImP. 

Definition 3. Define P h Q to be the projector on the residual sub- 
space in ImP. If all Q in Q satisfy (4.2), it is given by 

(4.5) PhQ = P-^'P>Q, 

QeQ 

where X^QeS means summation over Q in Q with Apq ^ 0. 

In particular, if PI Q = then P h Q = P, while if Pig = P then P h Q = 

0. 

As noted above, P h Q could be called "P orthogonal to Q." More accu- 
rately, the source Im(P h Q) is "the part of ImP that is orthogonal to V*c" 
Another way to put it is that it is "the part of Im P that contains no infor- 
mation about ImQ for any Q in Q." 

Finally, to realize our goal of a single decomposition of Vq using V and 
Q, we collect together the P > Q, for all P and Q, and PhQ, for all P, to 
make the following definition. 

Definition 4. If V, Q are orthogonal decompositions of Vh, Vx respec- 
tively, and a function / : £1 — > T is given such that Q is structure balanced 
in relation to V, then the decomposition V > Q of Vq is defined to be 

{P>Q:PeP,QG Q,A PQ ^0}U{PhQ:PeP}. 

In [29] it is argued that the analysis of data y from the experiment begins 
by projecting y onto the images of the elements of this decomposition. 

Lemma 4.2. If |T| = |Q| and Q is structure balanced in relation to V , 
then V> Q=Q. 

Proof. If |T| = |0| then Iq = I-p. If Q is structure balanced in relation 
to V, then equation (4.2) shows that A PQ QP = QPIqP = QPIpP = QP, 
so Apq is equal to or 1 for all P in V and all Q in Q. Moreover, Pig = 
PI-p = P, so there are no residual sources. Hence Q is a refinement of V in 
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the sense that each subspace in V is a direct sum of one or more subspaces 
in Q. □ 

In [14], Section 8.4, we commented that the direction of randomization 
is arbitrary when the two sets of objects have the same size. Lemma 4.2 
suggests that, in this case, the unrandomized tier should be the one with 
less structure, that is, the one placing fewer restrictions on the choice of 
random permutation. 

GenStat [32] accepts from the user the specification of V and Q via 
the BLOCKSTRUCTURE and TREATMENTSTRUCTURE directives, respectively, al- 
though each of V and Q has only to consist of a set of symmetric idem- 
potents that are first-order balanced in the sense that any two elements 
from different sets meet condition (4.3). It also allows pseudofactors in its 
TREATMENTSTRUCTURE directive. Its ANDVA directive uses the algorithm de- 
scribed by Wilkinson [39] and Payne and Wilkinson [34], first to check for 
first-order balance, and then to decompose y into its projections by the 
elements of V > Q. A similar system, implemented in P-Stat, is described 
in [22] — the Treatment statement specifies Q, and the Block and Error 
statements together specify V. In the programs R [36] and S-Plus [25], the 
decomposition V D> Q is obtained by specifying V as the part of the model 
inside an Error function and Q as the part outside this function. In these 
last two packages, there can even be nonorthogonality between the idempo- 
tents. 

In our examples, we often replace P and Q in Apq by the names of their 
corresponding sources. Similarly, in P > Q and P h Q we replace P and 
Q by the names of their sources to indicate the subspaces P(ImQ) and 
(ImP) (~l Vf, respectively. 

Example 2 (Continued). The Hasse diagram derived from the middle 
panel in Figure 2 is in Figure 6, where Q denotes Squares. There are seven 
elements of V, one for each source in Figure 6. The sources for the treatments 
tier are in Figure 4, and so Q has four elements. 

The relationship between V and Q is exhibited in the efficiency factors 
between their elements. The nonzero efficiency factors are AMean,Mean = 1, 
A C[Q],T = 1/9, A r#C [q] jT = 8/9, A H [qaraC],m = 1 and A H [qarac],t#m = 1- 
These lead to the decomposition in Table 1, which we have already discussed. 
Efficiency factors are not usually shown in the decomposition table if Q is 
orthogonal in relation to V; otherwise, only the nonzero efficiency factors are 
shown. The decomposition of the vector space indexed by the 48 half-plots 
is specified by the 11 elements in V > Q, one for each line in the table: 

f PMcant>QMcan,PR>Q,PQHQ,P Q#R hQ, 

V>Q=l P C [q]>Qt,P C [q]^Q,Pr#c[q]>Qt,Pr#c[q]^Q, ' 
[ p h[qarac] > Qm,Ph[qarac] > Qt#m,Ph[qarac] I" Q , 
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with 



PMean t> QMean 
PC[Q] > Qt 

PcfQi^Q 

p, 



PMcan — QMean j 

Pr, P Q hQ = P Q: 

(l/^-^qQjQTPqq], 



Q#Rl 



Q = P 



Q#R, 



C[Q] 



>Q T , 



'r#c[q] > Qt = (8/9) _1 Pr#c[q]QtPr#c[q]> 
Pr#c[q] l~ Q = Pr#c[q] - Pr#c[q] > Qt, 
Ph[qarac] > Qm = Qm, Ph[qarac] > Qt#m = Qt#m and 
Ph[qarac] l~ 2 = Ph[qarac] _ Qm - Qt#m- 

5. Structure for composed randomizations. For a pair of composed ran- 
domizations, there are three sets of objects T and T with structures V, 
Q and 1Z, respectively. While the results apply to quite general structures, 
those for our examples will be derived from a tier of factors that together 
uniquely index the set of objects. There is one randomization from T onto T, 
and one from T to fi. Two functions are required to encapsulate the results 
of these randomizations, say / : — > T and g : T — > T. This is represented 
diagrammatically in Figure 7. Of course, there is a third function implied 
here, namely the composite function g o f : £1 — > T. 

Here we extend the results obtained in Section 4 for two structures to 
obtain a single decomposition of Vh that combines the three structures in 
composed randomizations. We will write everything as if it were defined on 
rather than on T or T. In Section 4, we have already justified writing 
as Vy and Q^ as Q when T is randomized to Q, using the function /. When 
r is randomized to T, using the function g, and T is randomized to 0, using 
the function /, then the embedding of Vp in Vq depends on the composite 



Universe: Mean 
1 1 
Squares: Q 
2 1 
Q A Columns: C [Q] 
8 6 




Rows: R 
3 2 
Q A R: Q # R 
6 2 
Q A R A C: R # C [Q] 
24 12 



Q A R A C A Halfplots: H [Q A R A C] 
48 24 



Fig. 6. Hasse diagram for half-plots in Example 2. 
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function g o f but on no further information about how Vr is embedded 
in Vt- Thus we may identify Vr with (V^y . After these identifications we 
have Vr <Vf <V^i. Hence Ig is the identity not only for all Q in Q but 
also for all R in TZ; that is, Rig = IgR = R for all R in TZ. 

Theorem 5.1. Suppose that corresponding to the sets of objects £1, T, 
r there are sets V , Q, TZ of 0, x Q matrices, specifying orthogonal decompo- 
sitions of the subspaces Vq, Vt, Vr, respectively, with Vr <Vr < Vq; that Q 
is structure balanced in relation to V with efficiency matrix A-pg; and that 
TZ is structure balanced in relation to Q with efficiency matrix Aqu- Then: 

(a) TZ is structure balanced in relation to V with efficiency matrix A-pp. = 
ApgAgrc/ 

(b) TZ is structure balanced in relation to V > Q with efficiency matrix 
Ap>Q.TZ whose elements are the efficiency factors Ap[>q r = ApqAqr 
and A P |-g,R = 0; 

(c) Q > TZ is structure balanced in relation to V with efficiency matrix 
A-V,Qr>K whose elements are the efficiency factors Apq>r = Apq and 
Ap,Qh^ = Apq; 

(d) (V D> Q) D> TZ = V > (Q > TZ) = 

{(P>Q)>R:Pe^,Qeg,Reft,ApQ^0,A QR /0} 

U{(P>Q)K7e:PGP,QG Q,ApQ/0}U{PhQ:PG?}. 

Proof, (a) We need to show that RPR = EqsQ ApqAqrR for P in 
V and R in TZ, and that RiPR 2 = for Ri, R 2 in TZ with R x ^ R 2 . 

Since Eo. e gQ = Iq, we have IgPIg = £ QeQ QPIg = Eo. e g A PQQ- 
Also R = Rig. Hence 

RPR = RIgPIgR = RApqQR = J2 A PQ A QR R 

Q Q 

while RiPR 2 = Eq A pq RiQR 2 = 0. 

(b) For TZ to be structure balanced in relation to V \> Q with the given 
efficiency factors, we require the following conditions for all P in V, Q in Q, 
and R, R;, Kj in TZ: (i) R(P > Q)R = A pq AqrR; (ii) Rj(P D> Q)Rj = 
if Ri / Kj , (iii) R*(P h Q)Kj = 0. 

randomized in design 1 unrandomized in design 1 

randomized in design 2 unrandomized in design 2 



Fig. 7. Diagram of an experiment with two composed randomizations. 
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Equation (4.2) shows that Ig(P > Q)Ig = I q A P qPQPI q = A PQ Q. There- 
fore R(P > Q)R = RIq(P > Q)IqR = ApqRQR = A pq AqrR while Ri(P > 
Q)R 2 = A PQ RiQR 2 = if Ri ^ R 2 . This proves (i) and (ii). 

From equation (4.5), R^PR^ — Rj(P h Q)Rj = £' Q R»(P > Q)Rj Hence 
part (a) shows that R;(P h Q)Rj = if Rj ^ R, while R(P h Q)R = 
RPR — J2q ^pq^QrR- = 0. This proves (hi). 

(c) If M is in Q > 1Z then I C M = M = MIg, so MjPMj = MiI fi PI Q Mj = 
Mj Ylo ApqQMj , which is zero unless Mj and Mj have the same Q. If Mj 
and Mj have the same Q then MjPMj = A PQ MjQMj = ApcjMjMj = 
5yApQMjMj. Hence Q D> 1Z is structure balanced in relation to V with 
^P,Q>R = Apq and \p t Q\-n = Apq. 

(d) By (b), we may apply Theorem 4.1 and Definition 2 to the pair of 
structures V > Q and 1Z to obtain the matrix (P > Q) > R of orthogonal 
projection onto (P D> Q)(ImR): 

(P > Q) > R = A P ^A QR (P d> Q)R(P D> Q) = Ap^A Q 1 R (Ap^PQP)R(A p ^PQP) 

Moreover, equation (4.5) gives 

(PoQ)hft = P>Q- (P>Q)>R 

Ren 

Aqr^O 

Since each R is orthogonal to P h Q, we have (P h Q) h 1Z = P h Q. Thus 
Definition 4 gives the decomposition (P > Q) > 7?.. 

Similarly, for the pair of structures V and Q>TZ, part (c) gives 

P D> (Q > R) = A p ^P(Q > R)P = A p ^P(A QR QRQ)P 

and 

P>(Qh^) = ApJ J P('Q- Q>R-)P- 

^ Ren ' 



\ 



QR/O 



Then, by applying equation (4.5) twice, 



Ph(Q>7e) = P- ]T ^ P > (Q > R) - ]T P > (Q h ft) 

RenQeQ, QeQ 

= P- ^P>Q = PhQ. 

QeS 

Hence Definition 4 gives V \> (Q \> 1Z) = 

{P>(Q>R):P€P,QG Q,R€ft,A PQ ^0,A QR /0} 

(5.1) 

U{PD>(Qh^):PGP,QeQ,A PQ /0}U{PHQ:PGP}. 
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It now suffices to show that (P t> Q) > R = P > (Q D> R) and (P > Q) h 
TZ = P > (Q h TZ) for all P in V, Q in Q and R in TZ. Now, QPR = 
QPI Q R = ApqQR, so 

(P > Q) > R = ApjUo^PQPRPQP = Ap^Ao^PQRQP = P > (Q > R). 



(P>Q)hft = P>Q - ]T(P>Q)>R = P[>Q- ^ P > (Q > R) 



As noted in Section 4, if PI Q = then P h Q = P while if Pig = P then 
P h Q = 0. Analogously, if (P > Q)I^ = then (P>Q)hft = P[>Q while 
if (P D> Q)I^ = P > Q then (P D> Q) h TZ = 0. 

The algorithm in [39] for analysing two-tiered experiments obtains the 
decomposition of the data vector y into orthogonal vectors of the form Py 
according to the decomposition V and then further decomposes the result 
according to Q, to yield the combined decomposition V O Q of y. If 7^, is 
structure balanced in relation to V > Q, then we can run the algorithm again 
to obtain the decomposition (V > Q) > TZ of y. However, consideration of 
the order in which the randomizations have been performed suggests that 

V should be decomposed according to the joint decomposition Q > TZ, to 
yield the decomposition V > (Q > TZ) of y. Theorem 5.1(d) shows that the 
decomposition may be performed in either order. 

Example 1 (Continued). The Hasse diagrams for the treatments and 
meatloaves tiers are like those in Figures 4 and 3, respectively, while Fig- 
ure 8 gives the Hasse diagram for the tastings tier. Let V, Q and TZ be the 
structures corresponding to tastings, meatloaves and treatments, respec- 
tively. Both designs are orthogonal. Table 2 gives the decomposition table 
corresponding to the joint decomposition (V > Q) > TZ, or, equivalently, 

V > (Q > TZ). For the former decomposition, first obtain V O Q, shown in 
the two left columns. Then V O Q is decomposed by TZ. 

In this case, every nonzero entry in A-pg and Ag^ is equal to one. There 
is exactly one nonzero entry in each column of Aqu, so the nonzero entries 
in the product A-pgA^ are also all equal to one. Theorem 5.1(a) states 
that Apiz = A-pqAq-ji, so we see that, if the two component designs are 
orthogonal, then so is the composite design, which ignores the middle tier. 
In other words, TZ is orthogonal in relation to V . 

Moreover, the nonzero products of entries of A-pQ with entries of Aqr 
are also all equal to one. Hence Theorem 5.1(b) shows that A-p^g^ also 



Also 



ReTZ 




□ 
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Table 2 

Decomposition table for Example 1 



tastings tier 



d.f. 



meatloaves tier 



d.f. 



treatments tier 



d.f. 



Mean 



Mean 



Mean 



Sessions 



Blocks 



Panellists [S] 



33 



Time-orders [SI 



15 



P#T[S] 



165 



Meatloaves [B] 15 



Residual 



150 



Rosemary 1 
Irradiation 2 
Rosemary#Irradiation 2 
Residual 10 



has all nonzero elements equal to one. In other words, 1Z is orthogonal in 
relation to V > Q, and thus there is no need to show efficiency factors in the 
decomposition table. 

The full decomposition contains nine elements, one for each line in the 
table: 

(PMean > QMcan) > R-Mcan, Ps > Qb, Pp[S] ) Pt[S] > ] 
(Pp#T[S] > Qm[B]) > Rr, (Pp#T[S] > Qm[B]) > Rl, [ 
(Pp#T[S] > Qm[B]) > Rr#i 5 (Pp#T[S] > Qm[B]) I" T^i 
Pp#T[S] I - Q 



{v > Q) > n ■■ 



with 

(PMcan t> QMean) O RMcan = PMcan = QMcan = RMean, 



Universe: Mean 
1 1 



S A Panellists: P [S] 
36 33 




Sessions: S 
3 2 
S A Time- Orders: T [S] 
18 15 



S A P A T: P # T [S] 
216 165 



Fig. 8. Hasse diagram for tastings in Example 1. 
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Universe: Mean 
1 1 
Occasions: O 
2 1 
O A Intervals: I [O] 
6 4 
O A I A Sittings: S [O A I] 
24 18 




Judges: J 
6 5 
O A J: 0# J 

12 5 
O A I A J: I # J [O] 
36 20 
OAlASAJ:S#J[OAl] 
144 90 



O A I A S A J A Positions: P [O A I A S A J] 
576 432 



Fig. 9. Hasse diagram for evaluations in Example 2. 



P s > Q B = Ps = Qb = (Ps > Qb) i- n, 
p p[S] = p p[s] !~ Q> 

Pp[T] = Pp[T] ^~ Q) Pp#T[S] t> Qm[B] — Qm[B]i 

(Pp#T[S] > Qm[B]) > R R = R R> (Pp#T[S] > Qm[B]) > Rl = Rl, 
(Pp#T[S] > Qm[B]) > Rr#I = Rr#I> 

(Pp#T[s] > Qm[b]) ^71 = Qm[b] \-T^ = Qm[b] - Rr - Ri - Rr#i and 

Pp#T[S] l~ Q = Pp#T[S] — Qm[B]- 

Note the occurrence of the three different types of elements in the decom- 
position: (P > Q) > R, (P > Q) h 1Z and P h Q. As demonstrated here, for 
an orthogonal experiment some projection operators for the decomposition 
are just elements of V, Q and 1Z while each of the others is obtained by 
subtracting some of these elements from another one. 

Example 2 (Continued). The panel on the right of Figure 2 gives the 
Hasse diagram in Figure 9. This gives the sources in the left column of 
Table 3. 

Let V, Q and 1Z be the structures corresponding to evaluations, half-plots 
and treatments, respectively. As Q is structure balanced in relation to V, 
and 1Z is structure balanced in relation to Q, Theorem 5.1(a) implies that 
1Z is also structure balanced in relation to V and that the matrix A-pn of 
efficiencies for treatments in relation to evaluations is A-pqAqh. Computing 
this product shows that Trellis is estimated from S[0 A I] with efficiency 
1/27 and from S#J[0 A I] with efficiency 26/27. 

Table 3 gives the joint decomposition (V > Q) > 1Z, which contains 20 
elements. For this decomposition, first obtain V > Q, shown in the left two 
columns, with the nonzero entries of A-pQ included in the middle column. 
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Table 3 

Decomposition table for Example 2 (O — Occasions, I — Intervals, S = Sittings, J = 
Judges, P = Positions, Q = Squares, C = Columns, R — Rows, H = Halfplots, T = 

Trellis, M = Methods) 



evaluations tier 






half-plots tier 






treatments tier 




source 


d.f. 


eff. 


source 


d.f. 


eff. 


source 


d.f. 


Mean 


1 


1 


Mean 


1 


1 


Mean 


1 


O 


1 


1 


Q 


1 








I[0] 


4 














S[0 A I] 


18 


i 

3 


C[Q] 


6 


i 

27 


T 

Residual 


3 
3 








Residual 


12 








J 


5 














0#J 


5 














I#J[0] 


20 


1 
1 


R 

Q#R 
Residual 


2 
2 
16 








S#J[0 A I] 


90 


2 
3 


C[Q] 


6 


2 

27 


T 

Residual 


3 
3 






1 


R#C[Q] 


12 


8 
9 


T 

Residual 


3 
9 








Residual 


72 








P[0 A I A S A J] 


432 


1 


H[Q A R A C] 


24 


1 
1 


M 

T#M 
Residual 


1 

3 
20 








Residual 


408 









Then V t> Q is decomposed by 7Z; the properties of this decomposition are 
summarized in Ap>g7j,. As Theorem 5.1(b) states, the entries of this matrix 
are the products of the entries of A-pg with those of Aqu. For example, 

As[OAl]>C[Q],T = A S [OAl],C[Q]^C[Q],T = | X | = 27 

where Q denotes Squares. These nonzero entries are given in the efficiency 
column of the treatments tier in Table 3. The two elements of (V > Q) D> 1Z 
that correspond to the fourth and sixth lines of sources in Table 3 are: 

( p s[oai] > Qc[Q]) > r t = (| x i) Ps[oai]Qc[Q] r tQc[Q]Ps[oai] 
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randomized in design 1 unrandomized in design 1 

randomized in design 2 unrandomized in design 2 

Fig. 10. Diagram of an experiment with two randomized-inclusive randomizations. 
and 

P S[OAl] l~ Q = P S[OAl] - P S[OAl] > Qc[Q] 

= P S[OAl] - (|) lp S[OAl]Qc[Q] P S[OAl]- 
A full analysis-of- variance table is in [15]. 

6. Structure for randomized-inclusive randomizations. The set-up for 
an experiment with a pair of randomized-inclusive randomizations is the 
same as for composed randomizations, except that the randomization of T 
to £1 uses a design whose properties depend on the outcome of the first 
randomization. The order of the randomizations is prescribed, because the 
outcome of the first must be known before the second can be done. Two 
functions are required to encapsulate the results of these randomizations, 
say g : T — > T and / : — > T. This is shown diagrammatically in Figure 10. 
Note the similarity to Figure 7 for composed randomizations. 

There are structures V, Q\ and TZ\ on Q, T and T, respectively. We 
assume that design 1 is randomized using the group of permutations of T 
corresponding to structure Q\ and that 1Z\ is structure balanced in relation 
to Q\. For randomized-inclusive randomizations, the second randomization 
starts with a systematic plan using information from both the previous tiers, 
and uses the group of permutations of £l corresponding to V . However, unlike 
the situation for composed randomizations, Q\ is not structure balanced in 
relation to V. 

In simple cases, Q\ [> 1Z\ is structure balanced in relation to V, so the 
decomposition V t> (Qi > TZ\) is appropriate, and the decomposition must 
proceed from right to left. Example 1 of Wood, Williams and Speed [40] is 
of this form, as is the randomization in [14], Figure 17. In such cases, the 
second randomization is done using pseudofactors on T of the form Fog for 
some generalized factors F on V: these pseudofactors refine Q\ to Q\ > 1Z\. 
In practice we put Q = Q± \> 1Z\ and regard Q as the appropriate structure 
on T. Sometimes the pseudofactors needed on T for design 2 are defined by 
different generalized factors on T in each replicate. In these cases, Q\ must 
be refined to Q2 by these extra pseudofactors on T, and then in practice we 
put Q = Q 2 . 
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49 Lines 




49 lines 



7 Li, L3, L5, L 



4 Blocks 
► 49 Plots in B - 



7 L2, L4, L6, Lg 
196 plots 



7 Runs in I 
4 Intervals 




196 analyses 



Fig. 11. Randomized-inclusive randomizations in Example 5: lines are randomized to 
plots, which are in turn randomized to analyses; B denotes Blocks, I denotes Intervals; 
Li, . . . , Ls are mutually orthogonal pseudofactors for Lines; Pi and P2 are pseudofactors 
for Plots, determined from different Lines pseudofactors in different blocks. 



In more complicated cases, design 2 uses pseudofactors on T that were 
not used in design 1. These refine 7Z± to IZ2, and must be used to define 
pseudofactors on T that refine Q\ to Q2 ■ Then we put Q = Q2, as in Ex- 
ample 5. 

Writing 1Z for 1Z\ or IZ2 as appropriate, in all three cases we have 1Z 
structure balanced in relation to Q and Q structure balanced in relation to 
V . Hence Theorem 5.1 shows that 1Z is structure balanced in relation to V 
and that Ap-ji = A-pgAg-^.. The orthogonal decomposition of Vq resulting 
from the combination of the structures on V, Q and 1Z is given by either 
(V > Q) > 1Z or V > (Q > 1Z), which by Theorem 5.1(d) are equal. 

Example 5 (A two-phase wheat variety trial). Example 9 in [14] de- 
scribes an experiment that consists of a field phase and a laboratory phase. 
In the field phase, 49 lines of wheat are investigated using a randomized 
complete-block design with four blocks. The produce of each plot is anal- 
ysed using a gas chromatograph in which seven samples can be processed 
per run. In the laboratory phase, blocks are randomized to four intervals. In 
each interval, there are seven runs, in which samples are processed at seven 
consecutive times. The design allocates plots to analyses in such a way that 
the runs and times form a 7 x 7 balanced lattice square on the lines. The sets 
of objects for this experiment are analyses, plots and lines. Pseudofactors 
are introduced for lines and plots in order to define the design of the second 
phase. See Figure 11. 

Figure 12 shows the Hasse diagrams when pseudofactors are included. 
The sources for the structure V are in the top, left-hand diagram. With- 
out pseudofactors, the structure Qi on plots has sources Mean, Blocks and 
Plots [B]. With pseudofactors, the sources corresponding to Q2 are those in 
the top, right-hand diagram. Note the source Plots[B]|_, which consists of all 
contrasts for Plots orthogonal not only to Blocks, as for Plots [B], but also 
to the pseudofactors Pi and P2. Similarly, the natural structure 1Z\ on lines 
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has just the sources Mean and Lines, but the pseudofactors Li, . . . , Ls refine 
this to the structure 7^2 whose sources are in the bottom diagram. Reduce 
IZ2 to 7^3 by letting LinesR be the sum of the sources for Li, L3, L5 and L7, 
and Linesx the sum of the sources for L2, L4, Lg and Ls. 

Now, 1Z\ is structure balanced in relation to Q\, and TZ^ is structure 
balanced in relation to both Q\ and Qi. The three combined decompositions 
are in Table 4. In the third one, there are two sources involving LinesR 
because IZz is not orthogonal in relation to Qi- In fact, Ap 1 [B],Lines R = 1/4- By 
Lemma 4.1, P[B] D> LinesR = LinesR. Because Pi[B] > LinesR is a subspace of 
Pi[B] and they have the same dimension, Pi[B] > LinesR = Pi[B]. However, 
Pi[B] ^ LinesR, because the relevant efficiency factor is not 1. Lemma 4.2 
shows that any structure on plots which is structure balanced in relation 
to V must be a refinement of V. However, P[B] > Lines is not contained 
in any of the sources for V; nor is P[B] [> LinesR. Hence, of these three 
decompositions, only Q2 > TZ% is structure balanced in relation to V . This 
is why the pseudofactors for lines and plots are chosen as they are. 

The full decomposition is in Table 5. 

The first two examples in [40] involve randomized-inclusive randomiza- 
tions. This is not immediately obvious in [40], because the pseudofactors 
remain implicit, as do extra factors, such as Replicates, that refine Q. 



Universe: Mean 
1 1 



Universe: Mean 



1 1 




I A R A T: R # T [I] 
196 144 



B A Plots: Plots [B] 
196 144 



1 



Universe: Mean 
1 1 



Li: Li 
7 6 




Lg: Ls 
7 6 



Lines: L 
49 



FlG. 12. Hasse diagrams for analyses, plots and lines in Example 5. 
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Table 4 

Three decompositions of the plots space in Example 5 



Qi >TZi 




Qi > K 3 




Q 2 >7Z 3 




source 


d.f. 


source 


d.f. 


source 


d.f. 


Mean 


1 


Mean 


1 


Mean 


1 


Blocks 


3 


Blocks 


3 


Blocks 


3 


P[B] > Lines 


48 


P[B] > LinesR 


24 


Pi[B] E> LinesR 


24 


P[B] \-Ui 


144 


P[B] > Linesx 


24 


P 2 [B] E> LinesT 


24 






P[B] hft 3 


144 


Plots [B]h C> LinesR 


24 










Plots [B]h D> LinesT 


24 










Plots [B] h hTZ 3 


96 



Table 5 

Decomposition table for Example 5 



analyses tier 




plots tier 






lines tier 




source 


d.f. 


eff. 


source 


d.f. 


eff. 


source 


d.f. 


Mean 


1 


1 


Mean 


1 


1 


Mean 


1 


Intervals 


3 


1 


Blocks 


3 








Runs [I] 


24 


1 


Pip] 


24 


i 

4 


Lines r 


24 


Times [I] 


24 


1 


P 2 [B] 


24 


1 
4 


LinesT 


24 


R#T[I] 


144 


1 


Plots [B]h 


144 


3 
4 

4 


LinesR 
LinesT 
Residual 


24 
24 
96 



7. Structure-balanced experiments with a longer chain of randomiza- 
tions. It is clear that the results of Section 5 can be generalized by induction 
to experiments involving a chain of more than two randomizations so long 
as each successive pair is either composed or randomized-inclusive. For ex- 
ample, multiphase experiments with three or more phases will commonly 
employ three or more such randomizations. Suppose that fl; has structure 
Vii for i = 1, . . . ,p, and that the chain of randomizations goes from £l p to 
£lp-i to ...to Q\. Assume that, if necessary, pseudofactors have been in- 
cluded in each structure, as in Section 6. Then the decomposition is 

{■■■{Vi>V 2 )>---)>V p or Pi>(7> 2 >(--->Pp)-")- 

Theorem 5.1 shows that the operator > is associative, and so the parentheses 
may be omitted. 

The following example has four tiers and three randomizations. It appears 
quite simple, because, in each of the two proposed designs, the relationship 



28 



C. J. BRIEN AND R. A. BAILEY 



(b) 



4 Treatments - 



12 Batches^ 



4 treatments 12 batches 

12 batches 3 B 



48 Socks 



socks 




12 Batches 



4 Treatments 



4 treatments 4 Bi 
S 



48 socks 



4 S 2 




12 Washes 



4 Rides in W 
48 rides 

12 Washes 

*■ 4 Rides in W 
48 rides 



Fig. 13. Two methods of randomization in Example 6: socks are randomly split into 
batches, treatments are randomized to batches, and socks are randomized to rides; Si, . . . , S3 
are pseudofactors for Socks (different in the two methods); Bi and B2 are pseudofactors 
for Batches. 



between each successive pair of structures is orthogonality. Nevertheless, it 
shows how examination of the decomposition tables can aid the experimenter 
to choose among competing designs. 

Example 6 (Knitted socks). Example 14 in [14] concerns an experiment 
in which 48 knitted socks are randomly divided into twelve batches, to which 
four chlorination treatments are randomized. This develops Example 5.9 
from [17], in which various ways of randomizing the socks to rides in washing 
machines are suggested. Figure 13 shows two of these. In both cases, the 
first and second randomizations are composed while the second and third 
are randomized-inclusive. The results of Sections 5 and 6 can be applied 
directly to give the decompositions in Tables 6 and 7. In Table 6, the source 
Socks h Si is the part of Socks that is orthogonal to the source Si. The source 
Socks h (S2 A S3) in Table 7 is defined similarly. The first experiment has the 
advantage that the Residual for Treatments has more degrees of freedom 
but the second has the advantage that the Treatments source is confounded 
with Rides [Washes] rather than the potentially more variable Washes. 



8. Summary. In this paper, the notation V t> Q has been introduced for 
the orthogonal decomposition of Vh for a single randomization, based on 
the two structures V and Q. The idea has been extended to cover composed 
and randomized-inclusive randomizations, which involve three structures, 
V, Q and 1Z. For these two related types of multiple randomization, it has 
been proven that the orthogonal decompositions (V > Q) t> 1Z and V D> 
(Q > TZ) of Vh are the same. As a result, the decomposition may be done 
from left-to-right or right-to- left. The decomposition table for a left-to-right 
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Table 6 

Decomposition table for Example 6: first method 



rides tier 




socks tier 




batches tier 


treatments tier 


source 


d.f. 


source 


d.f. 


source d.f. 


source d.f. 


Mean 


1 


Mean 


1 


Mean 1 


Mean 1 


Washes 


11 


Si 


11 


Batches 11 


Treatments 3 
Residual 8 


Rides [Washes] 


36 


Socks h Si 


36 






Table 7 

Decomposition table for Example 6: second method 


rides tier 




socks tier 




batches tier 


treatments tier 


source 


d.f. 


source 


d.f. 


source d.f. 


source d.f. 


Mean 


1 


Mean 


1 


Mean 1 


Mean 1 


Washes 


11 


S 2 AS 3 


11 


B 2 2 
Residual 9 




Rides [Washes] 


36 


Socks h (S 2 AS 3 ) 


36 


Batches h B 2 9 


Treatments 3 
Residual 6 










Residual 27 





decomposition, when there are three tiers, can be produced using the AMTIER 
procedure in GenStat [16]. 

When it comes to designing experiments and evaluating these designs, we 
have obtained the particularly useful result that, if each design is structure 
balanced, then so is the composite. Further, its matrix of efficiency factors 
is the product of those for the individual designs. 

The question remains as to how the results for multiple randomizations 
that are unrandomized- inclusive, independent, coincident or double [14] com- 
pare with those presented here. It will be addressed in a further paper. 

Acknowledgments. The authors are grateful to the referees for sugges- 
tions that helped to improve the readability of the paper. 
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